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5 

ECHO DETECTOR HAVING CORRELATOR WITH PREPROCESSING 
FIELD OF THE INVENTION 

The invention relates to methods and apparatus for detecting delays in echoes, to echo 
iO cancellers having such apparatus, to central offices having such echo cancellers for use 

in telecommunications networks, to methods of providing telecommunications services 
using the above, to coiresponding software and to systems incorporating the above, and 
con*esponding methods. 

15 BACKGROUND 

There are various known signal processing methods for deteimining a delay of an echo 
(also termed echo ranging). As well as being useful for echo cancellation, it is useful in 
a wide variety of applications such as geology, oceanometry, mobile GSM and CDMA, 
radar target locating, underwater object location, and in particular at receivers of 

20 antenna arrays when multi-path reflections are detected. For the different applications, 
the principles are the same but implementation requirements or constraints may differ. 
In case of the geology or seismology, for mapping underground strata for oil & gas 
sources, speed of processing or calculation complexity arje not so important. In contrast, 
. real-time processing is needed for applications such as mobile network antennas, or 

25 target location in a radar system. For echo cancellation (EC) of telecommunication 
signals such as telephone calls, the main constraints are speed of detection and 
computational load, and at the same time accuracy and reliability are also as important. 
These requirements make the design very demanding. 

30 Two main factors contribute to the generation of echo: acoustic echo between the 
earphone and microphone of a telephone set, and electrical echo generated in the 
transmission systems for the transmit and receive directions of the connection. 
Hybrid circuits (two-wire to four-wire transformers) located at terminal exchanges or in 
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remote subscriber stages of a fixed network are the principal sources of electrical echo. 
Subscriber lines in a fixed network are normally two-wire lines for reasons of economy. 
Interexchange lines, on the other hand, are four-wire lines. 

5 There are two principal known types of delay detection algorithms for such 

applications, adaptive filtering and algorithms using a subband filter with a sine 
function. An example of the former, adaptive filtering for echo delay estimation, or 
echo location, is shown in Canadian Patent application 02319639 (Popovic). This uses 
an adaptive filter to model the echo using a downsampled version of the echo source. 

10 The adaptive filter generates an aliased transfer function having peaks corresponding to 
the delay of the echo. An example of the latter is shown in "implementation and 
evaluation of a real-time line echo bulk-delay estimator*' by Ramkumar et al, published 
in 2000 by IEEE, ref 0-7803-6293-4/00, 

15 Although echo cancellation systems have been implemented without delay 

detection, it will play an ever more important role in echo canceller (EC) systems for 
the following reasons. The actual echo delay can be exaggerated by newer cascaded 
networks. For example in cases of ATM (Asynchronous Transfer Mode) or GSM 
(Global System for Mobile) networks, the round trip could be 80-200ms. This would 

20 make the echo more pronounced and more noticeable. In such case if the echo tails are 
delayed beyond the scope of the EC capability, and the EC is not switched off 
temporarily, it could have very undesirable effects such as distortion, or additional 
unpleasant sounds, such as howling. In other words the EC as a system may rapidly 
become unstable. 

25 

There are two main objectives in detecting the delay, which are: 
« determining if the delay is within the required range, so that amongst other 
things, the EC can be adapted or switched off rapidly if it is out of range, and 

• if within range, estimating a value within ii ms for the echo delay, so that the 
30 EC can be adapted more effectively. 
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The known methods do not achieve these or do not achieve them with sufficiently 
low computational load, or short processing time, with appropriate accuracy and 
reliability. 

5 It is known in principle from US patent 4.577,309 (Barazeche et al.) to use 

correlation to determine echo delay, but only for the purpose of setting a variable delay 
in the canceller, and only by adding special autocorrelation signals. US patent 
5,737,410 (Vahatalo et al.) shows correlating speech signals to find the echo delay 
without adding auto correlation signals. It shows reducing the amount of calculation in 

10 such a correlation by using a sliding window of samples. This is achieved by updating 
the echo signal for each correlation, but not updating the original signal. Thus each 
succeeding correlation is based on an increasingly delayed version of the original signal. 
This leads to reduced calculation because a shorter window of samples can be used for 
the correlation, while still detecting delays much longer in duration than the window, as 

15 the window effectively slides with time, compared to the fixed window of samples of 
the original signal. At longer intervals, this fixed window is updated. This is said to be 
better than using subsampling to reduce the calculation, since siibsampling reduces the 
accuracy of the delay value. 

20 SUMMARY OF THE INVENTION 

It is an object of the invention to provide improved apparatus or methods. According to 
a first aspect of the invention, there is provided an echo detector having: 
a correlator for determining a correlation between an incoming signal an echo signal 
having an echo of the incoming signal, and 

25 a preprocesssor for extracting a characteristic pattern from the incoming signal and a 
characteristic pattern from the echo signal, 

the correlation being determined by correlating the characteristic patterns, and the 
detector being arranged to detect the echo from peaks in the correlation. 

30 The term characteristic pattern is not intended to encompass a low pass filtered version 
of the original signal such as that obtained in the above mentioned prior art by down 
sampling. Advantages of using a characteristic pattern for the correlation can include 
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improved robustness to noise, reduced computational load, or larger correlation 
window size. These advantages can arise because the characteristic pattern is easier to 
correlate than the original signal The pattern can contain more of the information in the 
signal which is useful for the correlation, and less of the information which is not useful 
5 for the correlation. As a result, the correlation output can provide clearer, more distinct 
peaks, with fewer false peaks to mask the peaks resulting from the echoes. Also, this 
can enable more down sampling before correlation, to reduce the computation load of 
the correlation, which is particularly useful for coarse estimation of echo delay. 

10 As a preferred additional feature, the characteristic patterns can be based on a quantised 
average of the respective signal or its power. In this case, the quantising serves to 
enhance the correlation peaks, and the averaging can serve to limit the number of 
transitions, to give more robustness to noise. 

15 As another preferred feature,.the quantisation can be into a binary form, using a 
threshold. 

As another preferred feature, the characteristic patterns are down sampled before the 
correlation. This can reduce the computation load. In many cases, such patterns can be 
20 down sampled much more than the original signals without undue loss of reliability of 
correlation. Such down sampling still causes a loss of precision of echo delay 
estimation, and so may be limited in some applications. 

As a preferred additional feature, the detector is arranged to determine a delay of the 
25 echo from the correlation. 

As another prefened additional feature, the correlation is arranged to be carried out 
recursively, using results obtained for preceding correlation values for determining a 
current correlation value. 

30 

An advantage of such recursive operation is that a dramatic drop in calculation is 
possible. In one example, the amount of calculation drops from being proportional to 
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N*N, to being proportional to 2N, where N is a number of samples in a correlation 
window. N can be a value in the order of hundreds in a typical application. 
Consequently there may be a corresponding reduction in computation delay. Depending 
on the recursive algorithm, the load may now be proportional to the number of samples 
5 correlated, rather than the square of the number of samples. Again, depending on the 
recursive algorithm, there may be some loss of accuracy, especially in the first few 
frames if the algorithm produces an approximated correlation value rather than precisely 
the same result as a non recursive algorithm. For many applications, the dramatic drop 
in computational load, and thus faster output of results is much more valuable. 

10 

As an additional preferred feature, the correlator is arranged to derive a current 
correlation component by correlating a current sample value of the characteristic pattern 
of the echo signal and a window of samples of the characteristic pattern of the incoming 
signal. 

15 

This feature helps enables the computational load to be kept low, by reducing the 
number of multiplications. 

As a preferred additional feature, the characteristic patiems can be based on zero 
20 crossings of the respective signal. This can also enable the con'elation peaks to be 
enhanced. 

As a preferred additional feature, the characteristic patterns can be based on auto 
coiTelation functions of the respective signal. This can also enable the correlation 
25 peaks to be enhanced. 

As a preferred additional feature, the characteristic patterns can be based on pitch. An 
advantage of using pitch for the correlation is that it can enable a more accurate and 
reliable detection of the echo. In particular, the detection can be more independent of 
30 signal level, more robust in the presence of noise. Altematively, the correlation update 
rate can be reduced further, since human pitch normally does not change as rapidly as 
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the entire signal. Typically it will not change over periods of 25-60 msecs This 
can outweigh the additional computational load and delay involved in determining the 
pitch. 

The term correlator here is intended to encompass anything which determines a 
5 similarity between the pitch of the two signals. Where the signal is an unvoiced part of 
speech, with no pitch, this still can be used for the correlation, since the correlator can 
distinguish occurrences of similar lack of signal from non similar higher level signals. 

Another aspect of the invention provides an echo canceller having such a detector. As a 
10 preferred additional feature, the canceller is arranged to be suppressed depending on an 
output of the echo detector. 

As a preferred additional feature, the canceller is arranged to adapt depending on an 
output of the echo detector. 

15 V 

As a preferred additional feature, the canceller has an adaptable range of echo delay, the 
range being adaptable depending on an output of the echo detector. 

As a preferred additional feature, the detector or the echo canceller is in the form of 
20 software. This recognises the value of software as a component which may have great 
value and be independently traded, separately to hardware components. 

Another aspect of the invention provides central office apparatus having the above echo 
canceller. Another aspect of the invention provides a method of providing a 
25 telecommunications service to subscribers, over a network, and using the above central 
office apparatus. 

Another aspect provides an echo detector having a coarse echo delay estimator and a 
fine echo detector, each arranged to detect echoes having delays within a given range, 
30 the range of the coarse detector being wider than the range of the fine detector, the 

range of the fine detector being adjustable according to an echo delay determined by the 
coarse echo delay estimator. 
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An advantage of this is that a better balance between accuracy of detection, extent of 
range, computational load and computational delay, can be achieved. In particular, this 
can enable the range of the fine detector to be reduced, thus reducing computational 
5 load and delay, and/or improving accuracy. 

As a preferred additional feature, a centre of the range of the fine detector is adjustable. 

As an additional feature the coarse echo delay estimator has a correlator for determining 
10 a correlation between an incoming signal an echo signal having an echo of the incoming 
signal, and a preprocesssor for extracting a characteristic pattern from the incoming 
signal and a characteristic pattern from the echo signal, 

the correlation being determined by correlating the extracted characteristic patterns, and 
the detector being arranged to detect the echo from peaks in the correlation. 

As another preferred feature, the characteristic pattern is based on a quantised average 
of the signal or its power. 

As another preferred feature, a delay value output by the coarse echo delay estimator is 
20 compared to a current range of the fine echo detector, and the result is used to confirm 
an echo detection output of the fine echo detector. 

Another aspect provides an echo canceller having the above echo detector, and arranged 
to suppress a cancellation action on the basis of an output of the coarse echo delay 
25 estimator. 

As a preferred additional feature, the canceller has a filler for filtering the incoming 
signal to model the echo, a centre of a window of the filter being adjustable on the basis 
of an output of the coarse echo delay estimator. 

30 

As a preferred additional feature, the centre of the window is adjustable by providing an 
adjustable delay for delaying the incoming signal before it is input to the filter. 
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Other aspects provide for methods or software corresponding to any of the apparatus or 
system aspects, or combinations or components of the above aspects. Other advantages 
than those set out above may be apparent to those skilled in the art, particularly over 
5 other prior art of which the inventor is not yet aware. The features of dependent claims 
within each aspect can be Combined with each other or with other aspects of the 
invention as would be apparent to those skilled in the art. 

BRffiF DESCRIPTION OF THE FIGURES 
10 Embodiments of the invention will now be described with reference to the figures as 
follows: 

Fig 1 shows in schematic form echo cancellers in a known network, 

Fig 2 shows in schematic form an arrangement of an echo canceller having a delay 

detector, 

15 Fig 3 shows a delay detector,.according to an embodiment of the invention, having a 
correlator using extracted characteristic patterns, 

Fig 4 shows an embodiment in which the extracted patterns are quantised average 
powers. 

Fig 5 shows another embodiment showing how the quantised average pov/er can be 
20 extracted and used, 

Fig 6 shows another embodiment in which the extracted patterns are zero crossings 
Fig 7 shows another embodiment in which the extracted patterns are auto correlation 
functions, 

Fig 8 shows another embodiment in which the extracted patterns are the pitch of the 
25 signals, 

Fig 9 shows an example of how the pitch can be extracted. 

Fig 10 shows an embodiment of an echo canceller having a coarse delay estimator and a 
fine echo delay detector, 

Fig 11 shows more details of a delay detector implementation according to another 
30 embodiment, 

Fig 12 shows more details of an implementation of the recursive algorithm shown in 
Figure 11, and 
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Fig 13 shows an example of how to implement a classifier for processing the 

correlation results* 

DETAn.FD DESCRIPTION 

5 

Figure L showing how echo cancellers are used in conventional telephone networks 
Figure 1 shows an application of the echo canceller of the invention in a conventional 
telephone network. In this figure, a long-distance telephone network 50 is shown, for 
making a telephone call from one subscriber to another. For convenience, one side of 

10 the network is denoted the near end, and the other side is denoted the far end. A 
subscriber's handset 90 is coupled to a private branch exchange (P B X) by a 2-vvire 
subscriber line 45. In the P B X, a hybrid coil 60 is used to convert between the two 
wire subscriber line and a 4 - wire line to the Central Office or local exchange 5 1 . The 
conversion to4-wire enables the voice signals in two directions to be a separated, which 

15 is useful for digitising and further processing. Each P B X may support tens or hundreds 
of subscribers, and will have sufficient hybrid coils according to how many calls are to 
be supported simultaneously. 

Connections from many PBXs and many subscriber lines may be concentrated at a 
20 Central Office 10, which maybe many miles away from the subscriber. The central 
office contains the echo canceller 70. and a switch 80. For the sake of clarity, many 
other functions of the Central Office are not illustrated. There may be many echo 
cancellers provided, according to how many calls are to be handled simultaneously. 
Conventionally, each Central Office concentrates many calls on to one or more or trunk 
25 routes 130 which make up the long distance telephone network 50. At the far end, 
similar elements and functions are provided. A far end Central Office 52 contains an 
echo canceller 110 and a switch 100. 4-wire lines 150 are provide to connect the Central 
Office to one or more P B Xs 53. Each will contain a hybrid 120. Two-wire subscriber 
lines 160 couple handsets 165 to the hybrid. 

30 

As the echo cancellers are intended to cancel echoes arising from the hybrids at each 
end of the circuit, in principle, they can be located anywhere in between the hybrids. 
They are in practice usually located in a central office where many lines are switched 
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and concentrated. This is convenient to enable them to be shared to make more 
efficient use of limited processing resource, and for ease of access. 

Figure 2. showing principal elements of an echo canceller with a delay detector DP 
5 In figure 2 a near end signal x(n) has an echo added by a hybrid 60. The hybrid has an 
impulse response h(n). An adder 210 is shown to represent schematically the addition 
of the echo r(n) to the near end signal, resulting in an echo signal s(n). The principal 
elements of the echo cancellor shown in figure 2 are an adaptive filler 200, a double 
talk/EPC (echo path change) control element 230, a delay detector (DD) 240, and 
10 bypass switch 250, and a subtractor 220, The adaptive filter creates a model echo r(n) 
from an incoming signal y(n) also termed a far end signal. The model echo and the 
echo signal are fed to the subtractor. The model echo is subtracted from the echo signal 
to produce an echo cancelled signal e(n). 

15 The bypass switch 250 enables the echo signal to bypass the subtractor. This can be 
achieved by a switch before the subtractor, or a switch after the subtractor, as would be 
well known to those skilled in the art. The bypass switch is controlled by the delay 
detector or the double talk/EPC control element. In both cases, the purpose is to avoid 
or reduce distortion in particular circumstances. Other detectors not shown may also 

20 trigger a bypass, such as tone detectors. The delay detector can also be arranged to 
influence the adaptive filter. 

The delay detector is arranged to receive the echo signal from the near end and the 
incoming signal from the far end. It is arranged to carry out the correlation between 
25 these signals to detect echoes and determine echo delays. Some examples of delay 
detectors will now be described in more detail. 

Figure 3. showing an embodiment of the delay detector having a characteristic pattern 
extractor, 

30 Figure 3 shows in schematic form principal elements of a delay detector according to an 
embodiment of the invention. This embodiment can be used to implement the delay 
detector of figure 2. The incoming signal and the echo signal are fed to preprocessors in 
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the form of extractors 72. 74 for extracting a characteristic pattern from the echo signal 
and the incoming signal respectively. The values of the characteristic pattern extracted, 
are fed to a correlator 68. The output of the correlator is fed to a detector 59 for 
detecting consistent peaks in the correlation. The consistent peaks provide an indication 
5 of echoes, and the index of each consistent peak indicates the echo delay for each echo. 

Figs 4.5. embodiments using correlation of quantised average power 

Figure 4 is as figure 3, but where the characteristic pattern is the quantised average 

power, extracted from the incoming signal and the echo signal respectively by elements 

10 725 and 735. A more detailed embodiment is shown in figure 5. This way or other ways 
can be used to implement the arrangement of figure 4. As shown in figure 5, a 
quantised average power is determined for each of the incoming signal and the echo . 
signal, by elements 600 and 601 respectively. The output of these elements is a binary 
pattern. Components making up this element are shown for element 600 and omitted 

15 for element 601, for the sake.pf clarity. These components include a buffer 610 for 
converting a serial stream of 8kHz samples into a frame.. This is effectively a serial to 
parallel convertor. These vectors are output at the 8kHz sampling rate, (or optionally a 
slower rate) and so each vector contains a fixed number of samples N which appeared in 
the preceding vector, but are shifted one place, together with one new sample. In other 

20 words there is N-I samples of overlap between the vectors. In a typical example, N is 
128. 

Each sample in each vector is squared to make them all positive and proportional to 
signal power, in element 620. As there is so much overlap, the squaring function can be 

25 carried out recursively using the results from the squaring of the preceding vector, and 
adding a square of the most recent sample, and subtracting the square of the oldest 
sample. In element 630, an average value for the whole vector is obtained by summing 
these squares, and dividing by the length of the vector . At element 650, this average 
• value is quantised by threshold element 650. (Optionally this value can be converted to 

30 a dB value before thresholding.) This produces a binary value. In principle, the 

quantisation could be multi level, using multiple thresholds. The threshold level can be 
adaptable, and is selected to give enough transitions to give a good pattern for reliable 
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correlation. In a typical example this threshold could be -40dB, equivalent to a 
barely audible speech level. The averaging over a whole vector of N samples helps 
reduce susceptibility to noise. In principle, the power could be averaged without 
quantisation, or quantised without averaging, but in practice, the combination produces 
5 better results. Also, signal level or some value derived from signal level could be used 
other than signal power. 

Before correlation by element 690, optionally, both quantised average powers can be 
downsampled by element 660, in other words, decimated, to reduce the computational 

10 load in the correlator. The amount of decimation may depend on the accuracy of delay 
value required. For a coarse echo delay detector, decimation may be by a factor of up to 
64, and still have enough information to give reliable correlation. The use of a 
characteristic pattern rather than the original signal enables much more decimation than 
could be achieved using the original signal. This enables the coirelation window to be 

15 much longer for a given computational load. The coirelation window length determines 
the range of echo delays that can be detected, which is a significant constraint in 
previous echo detectors. 

The coirelator 690 can be any conventional correlator, but preferably is a recursive type 
20 as described below in more detail. The output of the coiTelator will be a vector, updated 
at a given rate. In this case if the window length is 128 (decimated) samples (i.e. 
1024msec), whose rate is 8Khz divided by 64 = i28H2, the output correlation vector or 
frame will also be 128 samples long. Element 700 finds consistent peaks in this vector. 
This can be done using the classifier described below with reference to figures 1 1 and 
25 13, or any type of conventional peak detector. The outputs of the peak detector are a 
flag to indicate a peak, and an index indicating the position of the peak value in the 
vector. This index is multiplied by the downsampling factor using element 710, to 
output a delay value. As mentioned above, these outputs can be used in many 
applications. In echo cancelling applications, they can be used to vary an adaptive filter, 
30 by varying the coefficients, or delaying the input for example, or suppressing the 
cancellation by a bypass or similar operation. 
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Figs 6 and 7. other examples of characteristic patterns. 

Figure 6 corresponds to figure 4 or figure 3, where the characteristic pattern is another 
quantized pattern, this time based on zero crossings of the signals. Corresponding . 
reference numerals have been used as appropriate. A zero crossing extractor 745 
operates on the incoming signal, and a similar extractor 750 operates on the echo signal. 
Figure 7 also corresponds to figure 4 or figure 3, where the characteristic pattern is 
based on an auto correlation function. This may be thresholded to give a quantised 
pattern. Coiresponding reference numerals have been used as appropriate. A first auto 
correlation function 755 operates on the incoming signal, and a similar one 760 operates 
on the echo signal. These auto correlation functions can be implemented following 
established design principles. Other types of characteristic patterns can be conceived 
with similar advantages. 

Figure 8, showing an embodiment of the delav detector having a pitch extractor. 
Figure 8 shows in schematic Xoim principal elements of a delay detector according to an 
embodiment of the invention. This embodiment can be used to implement the delay 
detector of figure 2. The incoming signal and the echo signal are fed to extractors 770, 
765 for extracting a pitch value from the echo signal and the incoming signal 
respectively. The values of the pitch, are fed to a correlator 68, The output of the 
correlator is fed to a detector 59 for detecting peaks in the coirelation. The consistent 
peaks provide an indication of echoes, and the index of each peak indicates the echo 
delay for each echo. 

Pitch Extraction 

As described in US patent 6,035,271 to Chen, pitch extraction has been well known as 
an essential part of speech signal processing for decades. Traditionally, pitch is defined 
as the fundamental frequency of the voiced sections of a speech signal. Three methods 
are typically used for pitch extraction: autocorrelation, cepstrum, and subharmonic 
summation, as described in text books on the subject such as Wolfgang J. Hess, "Pitch 
Determination of Speech Signals Algorithms and Devices" Springer- Verlag, Beriin, 
1983; and L. R. Rabiner and R. W. Schafer, "Digital Processing of Speech Signals", 
published in 1978, ISBN 0-13-213603-1. 
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In the book "digital processing of speech signals" by Rabiner & Schafer, there is 
discussion of pitch period detection at Pages 314-319, 372-379 and 150-158. Various 
ways of determining pitch period are known, including an impulse train algorithm 
5 shown at Page 136, which is very computationally intensive, a Fourier representation 
technique shown at Page 314 onwards, and an auto correlation function approach using 
centre clipping, shown at Pages 150-158. For example, using the autocorrelation 
method, the most prominent peak is identified with pitch. However, in many cases, the 
peak is at frequencies other than the expected pitch. In fact, if a signal contains 
10 sinusoidal components of fundamental frequency and its harmonics, then the peaks in 
the autocon-elation function can be at many different times. 

Figure 9, Pitch Extractor 

An example of how to implement the pitch extraction will now be described with 
15 reference to figure 9. It can be implemented in other ways, as mentioned above. In this 
embodiment, an auto correlation function (ACF) method is preferred. This involves 
comparing the samples from the frame with a delayed version of the same frame, and 
determining how closely they match. The matching can be done by multiplying each 
sample with a respective delayed sample, and summing the products to obtain one 
20 correlation value for each delay. Where there are N samples in the frame, the 

correlations can be carried out with up to 2N different offsets or delays. To obtain a 
complete coirelation profile, a correlation value should be obtained for each of these 2N 
different offsets. This can be implemented using a buffer 405. then feeding a frame to 
the ACF function 410. Any peaks in this correlation profile can represent pitch periods, 
25 if the peaks are above a certain threshold, and the offset or delay that results in the peak, 
is the pitch period value. 

For determining the presence of such pitch periods to detect echoes of voice or tones, it 
is not necessary to calculate the correlation profile over the entire range of offsets. A 
30 range of 3 to 14 milliseconds of offset will be sufficient to cover all the frequencies of 
interest in the speech. Since determining the correlation profile is computationally 
intensive, a valuable reduction in the amount of calculation can be achieved by limiting 
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the range of offset for which the correlation profile is determined. The profile is 
analysed by element 430 to determine if there are any peaks that would indicate the 
presence of a pitch period. Provided the peak has a magnitude of greater than a 
threshold such as one-third of the normalised maximum signal power, determined by 
5 element 440, the decision is made that a pitch period is present in that frame. The 
normalised maximum signal power may be represented as R(0), as shown in element 
420 in the figure. The result in terms of a quantized value for the presence of a pitch 
period is then output, 

10 Where a pitch period has been detected, then a value for the pitch period is output by 
element 460, in terms of the offset , also called index, at which the peak was detected. 
The two pitch period signals extracted from the incoming signal and the echo signal are 
then correlated, to find consistent peaks corresponding to echoes. 

15 Figure 10. canceller having fine echo detector and coarse echo delay estimator 
Figure 10 shows in schematic form elements of an echo canceller according to an 
embodiment of the invention, similar to the arrangement shown in figure 2. 
Corresponding reference numerals have been used where appropriate. The main 
difference is the delay detector, 268 which has a fine echo delay detect element 265 and 

20 a coarse echo delay estimate element 260. The coarse echo estimate element is arranged 
to detemine an echo delay from the echo signal and the incoming signal. This can be 
implemented using a correlation element or optionally using other well known ways for 
estimating delays. An example of how it can be implemented is described above with 
reference to figure 5. 

25 

As shown in the figure, the delay estimate output from this element is fed to a delay 
element 270. to adjust a delay applied to the input of the adaptive filter 200 derived 
from the incoming signal. This has the effect of altering the centre of the filter window, 
enabling modelling of echoes having a range of delays around the centre delay. Other 
30 ways of achieving this are readily available, including altering the filter coefficients 
directly on the basis of the estimated delay value, or altering the filter window such as 
by altering the sampling rate. 
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The outputs of the fine delay are likewise used to control the delay. This can enable 
finer adjustment of the delay. Notably the inputs to the fine echo delay estimate element 
are the echo signal and the delayed version of the incoming signal. By using the 
delayed version, effectively the centre of the range of echo delays detectable by this 
element is shifted. As this shift is controlled principally by the output of the coarse echo 
delay estimate element, this is a way of making the range of the fine detector adjustable 
according to the echo delay, leading to the advantages set out above in the summary of 
invention section. The fine detector may be implemented using a correlation method, or 
alternative known methods. Preferably the coarse estimator has a much wider range, 
and much greater subsampling to reduce computation load, than the fine detector, The 
fine detector is more concerned with accuracy of the delay value, and subsampling 
reduces the accuracy. Since the fine detector has an adaptable centre of its range, the 
breadth of the range can be arranged to be much shorter. This range is proportional to 
the correlation window length. The computational load can be reduced for the fine 
detector by having a much shorter correlation window, instead of subsampling heavily. 

The outputs of the delay detector can be used for adjusting the delay 270, and/or for 
suppressing the cancellation if the delay detector determines there is no echo, or it is out 
of range (also called out of band). In this case, a bypass switch 220 is provided, to 
bypass subtractor 220 but altematives are conceivable, including interrupting the output 
of the adaptive filter for example. For driving the bypass switch, the "Inband/Outband*' 
flag output by the fine detector and the delay value output by the coarse estimator are 
combined as follows. If the fine detector flag shows inband, and this is confirmed by the 
echo delay estimate of the coarse estimator, then the bypass is not operated. If the flag 
indicates outband, and the delay estimate is invalid or out of range, then the bypass is 
operated. If there is a conflict, i.e. inband but out of range, or outband but in range, then 
the default is that the bypass is operated, to avoid disturbing effects such as howling. 

The algorithm works like this: 

Run DD128 (this is the fine detector, with a 128 msec detection range) 
If inbandl28-flag is HI (in) ( i.e. this is the output of the fine detector) 
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Adaptive filter (AF) is ACTIVE 

Else 

Run the long DD1024 (i.e. this is the coarse detector with a 1024ms detection 

range) 

5 If inbandl024 is HI (i.e. there is a delay >128ms but <1024) 

Then the small DD128 can be moved to there, centred or bypassed 
switch if they don't want to use the DD128 

else 

go to bypass switch 

10 

In a typical implementation of a coarse delay estimator using the arrangement of figure 
5, the subsampling factor can be as high as 64, the correlation window can be 128 
samples long. The range can extend to 1024 msecs, well beyond the usual limit of 
128msecs of known systems (this increase the range of the delay detector by factor of 8 
15 with practically the same amount of computation as a 128msec range detector). 

Figure 11, showing a more detailed view of a delay detector. 

More details of one example of how to implement the detector 75 for detecting peaks in 
the correlation, and then classifying the peaks, is shown in Figure 1 L This arrangement 

20 can be used as a complete delay detector, or as a fine detector, or as a coarse delay 
estimator. The choices of characteristic pattern, of correlation frame length and of 
downsampling factor, and of sophistication of the classifier, can be made to suit the 
application. To detect distinct peaks, a time margin around the highest peak can be 
used. A next highest peak outside the time margin is detected. Other peaks within the 

25 time margin are disregarded. This takes account of the common situation of an echo 
causing several closely spaced peaks in the correlation. The widely spaced peaks can 
represent echoes with widely differing delays. The peaks are fed to a classifier for 
classifying the peaks as shown in figure 1 1. In particular, the classification may include 
whether the peaks represent an echo which is suitable for cancellation, or periodic 

30 signals or other artefacts which might cause the echo canceller to become unstable, or 
degrade its output. Again, various implementations of each of these elements are 
possible, as will be explained in more detail below. 
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In figure 11, inputs to the delay detector include an echo signal from the near end (NE), 
and an incoming signal from the far end (FE). The outputs include an inband / outband 
flag to indicate an echo which is suitable for cancellation, and one or more delay values 

5 corresponding to the echo or echoes detected. The echo signal from the near end (NE) 
is fed to a characteristic pattern extractor 11, The output is downsampled (decimated) 
by element 500 (having a decimation factor D, which can be 4 in one example). If the 
decimation is by a factor of 4, then for an original sampling rate of SkHz, becomes a 
decimated sampling rate of 2kHz. This enables a frequency range of IkHz to be 

10 represented. 

The incoming signal is similarly fed to a characteristic pattern extractor 10. The pattern 
output is fed to a downsampling element 510. Both downsampled characteristic 
patterns are fed to a recursive correlator 16 to give a cross correlation factor (CCF). In 
15 the case of the incoming sigaal pattern a buffer and overiap function 14 is provided 
before input to the correlator. The buffer function enables a window of consecutive 
samples of the pattern to be presented as a vector to the correlator. The overlap 
indicates that consecutive vectors are formed from overlapping windows of samples, 

20 The echo signal pattern is fed to the correlator via a resampling element 15. This 

provides a delay to synchronise the signals, and reflects the correlation update rate, M 
which may be lower than the decimated sample rate to reduce the calculation load. No 
buffering is provided for this signal, since the recursive coirelator can operate on just 
the current sample value of the characteristic pattern of the echo signal, without 

25 necessarily needing a vector of consecutive samples. An automatic gain control stage 
may be provided, (not shown in this figure). 

A modulus of the vector produced by the correlator is derived and fed to classifying 
stages 17-23,27. The modulus is derived so that positive or negative correlation peaks 
30 have the same effect. An updated vector is fed to the classifying stages every M 

samples. For each vector, first, a maximum and index finder 17, is used for determining 
a maximum peak in the entire vector, and a corresponding index (which represents the 
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delay at this peak). These are used to control a series of vector selectors 18, 20, 
22. These are used to determine the next highest peaks in parts of the correlation away 
from the highest peak. These parts may be omitted optionally, if it is not desired to track 
multiple echoes, by detecting multiple peaks. The vector may be divided into quarters or 
5 some other fraction. A first quarter can be centred on the index of the maximum peak, 
and other quarters or parts of quarters derived from the index of the maximum peak. 
Vector selectors 18, 20, 22 and so on are each used to carry out this segmentation, and 
feed the selected part of the vector to their associated maximum and index elements 19. 
21,23. 

10 

These elements determine the highest peak in their quarter, or part of the vector, and the 
corresponding index value. This segmentation of the vector enables search time for the 
next highest peaks, and computational load to be reduced. The detailed implementation 
of these elements is a matter of design convenience following established practice. 

15 

The outputs of these elements are fed into a logical state machine 27 which determines 
ratios of magnitudes of successive distinct peaks separated by time margins of R 
samples. (In a typical example, R=15). The ratios are thresholded, and if they show the 
ratio is large enough, the echo is flagged as inband. The flag can also show whether 
20 more than one distinct peak has been identified. The delay value represented by the 
index of the respective peak is derived by determining a product of the decimation 
factor and the index. Where multiple echoes are detected, multiple delay values can be 
output by the logical state machine. 

25 Optionally, the logical state machine may have other inputs for controlling the flag 
output. Examples include the EPC detector 24, the double-talk detector 25 and the EE 
and NE activity detector 26. A tone detector 520 may be useful so that if a tone is 
detected, the state machine should be disabled or indicate that the correlator output 
cannot distinguish between echo or no echo. The flags can be arranged to indicate a 

30 "don't know" condition, and any delay value will be unreliable in this case. Normally 
the classifier will show no echo, because the correlator output will not show a 
sufficiently distinct peak. But this may mask the fact that there is an undetected echo. 
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Recursive Correlator 

The correlator may be a recursive correlator, and such a correlator will now be 
described in more detail. The cross-correlation of the two signals NE and FE is 
5 calculated as in following equation for the correlation vector, termed the cross 
correlation factor ccf . 

/t=0 

The MDPs (Million Instructions per second) consumption of the above equation is of 
10 factor of N (the length of the con-elation window, measured in samples), where in our 

case N=1024, and a sampling rate of 8 kHz, i.e. about 8000 MIPS 

(1024*1024*8000*10**^ - 8388). In order to reduce the computational the following 

measures can be taken: 

a. Use a recursive algorithm for CCF. 
15 b. Sub-rating (also called decimating) the FE and NE signals by a factor of D. 

c. Different updating rate of the correlation, by a factor of M of the decimated 

samples, In other words, successive correlation windows overlap not by all but 1 

decimated sample but by all but M decimated samples. 

20 For example using measures b and c, D = 4 and M = 2, MIPS consumption will be 

reduced to {256*256*8000*10*^(4*2) 65}. This is still an unrealistic computational 
load for many applications. To make above ccf equation realizable a recursive algorithm 
can be used to make a much greater impact on the computational load. An example of a 
recursive algorithm now follows: 

25 

CCFl-l/Q (rL-5(n))+(l-l/0CCFL,^ 
where = y(n) + y(n-l)+A + yiO) 

and CCF[ = ccf{n) + ccf(n - 1) +A + cc/(0) n = 0,1,2,A ,N-l 
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Q is the span factor which represents how much of the preceding ccf to remember. 
Using this equation will reduce the computation drastically. For example with use of D 
= 4, and M = 2, the MIPS consumption will be: { 256*2*8000* 10'^/(4*2) =^ 0.512} 
which is not only realistic, it is a considerable improvement on other known methods. 
5 For a coarse detector with D=64 and M=l. this would give 128*2*8000/64=0.032 MIPS 
which is even more dramatic. 

Figure 12. showing a more detailed view of the recursive correlator. 

In figure 12, reference numerals corresponding to those in figure 11 where appropriate. 

10 The echo signal and incoming signal are processed as shown in figure 1 1 up to the point 
where they are input to the recursive correlator. As shown in figure 12, the incoming 
signal in the form of a vector output by the buffer and overlap element 14 is fed to a 
multiplier 30. The echo signal as output by the resampling element 15 is amplified by a 
constant factor 1/Q in element 31. This result is fed to the multiplier and multiplied by 

15 the incoming signal vector. This produces an instantaneous conelation vector, but only 
in respect of the current echo signal sample. 

Resampling is effectively for synchronising the inputs of the correlator, since the buffer 
and overlap element introduces a delay. Resampling also effectively decimates the 
20 signal further, by a factor of M, to match the reduced rate of the output of the buffer and 
overlap element. 

To correlate a window of echo signal pitch samples with a window of incoming signal 
pitch sample signals, the instantaneous coirelation vector is added to preceding 

25 instantaneous correlation vectors. To achieve this recursive function, an adder 32, an 
amplifier 33 and a delay element 34 are provided. The instantaneous correlation vector 
is added to a delayed version of the output of the adder. The delayed version is 
amplified by a constant factor 1-1/Q before input to the adder. The amplifiers 31 and 33 
may be used for AGC purposes as well. The output of the adder 32 may be used to feed 

30 an element (not shown) for deriving a modulus of the vector. 

Figure 13. showing operation of the classifier. 
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Figure 13 shows in schematic form some of the principal steps in the operation of a 
classifier such as that shown in figure 1 1. As shown in figure 13. the correlation vector 
is first fed to a modulus function 300 so that positive or negative correlation peaks are 
not treated differently. Step 310 takes the modulus and finds a maximum correlation 
value and its corresponding delay. At step 320 the next highest correlation value is 
determined disregarding peaks having delays within a margin around the delay value of 
the highest peak. At step 330, the peaks are classified by taking a ratio of the highest 
correlation value and the next highest correlation value. At step 340. the ratio is 
thresholded and if it exceeds a threshold, at step 350. a delay value is derived and 
output. It is derived by generating a product of the index and the decimation factor D. 

At step 360, the flag is set to inband. Although it is possible to look for further peaks, if 
the threshold ratio is around 1.5 or greater, there will be limited benefit for echo 
cancellation applications at least, in looking for lower peaks. In contrast, if the threshold 
is not exceeded, it is more worthwhile looking for the ratio of the two next highest 
peaks. Hence at step 370, a ratio of the second and third highest peaks is derived. At 
step 380, it is compared to a threshold, typically but not necessarily the same threshold 
as before. If the threshold is exceeded this time, two delay values are output at step 
390, corresponding to the two highest peaks. As before, they are derived by multiplying 
the index value con-esponding to the peak, with the decimation value D. Also, at step 
360, the flag is set to "inband". If the threshold is not exceeded, the flag is set to 
"outband". 

In principle, further echoes could be classified and flagged in a similar manner, though 
with decreasing additional benefit in terms of echo cancellation performance. Other 
applications may benefit from classifying other weaker echoes. 

In other words, the method can be summarised as follows: 

a. Take the modules of the CCF vector, i.e. |CCF| 

b. Find the max index and the value of the max. CCFmaxi index and CCFmaxi, 
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c. Find the second max outside the specific range (R), i.e. CCF|n ± R as 
CCFmax2indcx and CCFniax2- This is done since if the FE and its delayed echo is present 
within the search range, an exponential decay behaviour of CCF per hybrid is expected. 

d. Calculate the ratio and threshold it: CCFmaxiindcx / CCFmax2indcx > RATIO 
5 e. If the statement 'd' is TRUE then it is flagged as INBAND otherwise: 

f. find the third max outside the specific range (R). i.e. CCF|n ± R as CCFmax3index 
and CCFmax3. 

g. Calculate the ratio and threshold it: CCFmax2indcx / CCFmax3index > RATIO. 

h. If the statement 'd* is TRUE then it is flagged as INBAND otherwise flag as 
10 OUTBAND. 

Other variations, comments and remarks 

In a typical application for a fine detector the following parameters can be used for 
searching for an echo delay of up to I28ms (1024ms for a coarse detector): 
15 D = 2-8. i.e. buffer of 128-51.2 samples for a window size N of 1024 samples. (2-64 for 
a coarse detector) 

M = 1-6, i.e. overlap of 250-255 if the buffer is 256 samples. 
Q = 512, (64-512 for a coarse detector) 
RATIO =1.5-2.1, 
20 R = 10-25 samples. 

Advantages found for the given settings include the MIPS and memory consumption 
being very low, i.e. for a fine detector <1 MIPS and memory usage of less than 
Ikwords, where the word size depends on the processor. For a coarse detector these 
25 figures can be <0.05MIPS and <.5kWords. The echo detection is not restricted to 

echoes of speech, it can encompass echoes of other signals such as pulses or tones for 
example. The tones can include signalling tones of single or multiple frequencies, or 
any other types of tones. 

30 The echo canceller can be bypassed depending on the echo detector outputs, or in 

principle, the echo model input to the subtracter can be suppressed. The adaptive filter 
can be adapted using the echo delay values, and the correlation peak values, or shape. 
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Additionally, the far end signal can be delayed before input to the adaptive filter, 
to enable a simpler, faster and more efficient filtering operation. 

In another embodiment of the EC, the delay value from the delay detector would play a 
5 more important rule since, if echo delays (in other words the EC tails) lie in a different 
range, say from 128ms to 256ms (for example) the same 128ms filter length (and thus 
the same computational load) can still be used by just delaying the FE signal by 
estimated delay and putting the 128ms filter on that region or range. This is valid 
providing all of the echoes (hybrid(s) tails) are in the given range i.e. 128-256ms. 

10 

The delay detector and the echo canceller and other functions can be implemented in 
well known programming languages such as C or Ada, or others, as would be well 
known to those skilled in the art. The resulting code can be cross-compiled into a lower 
level language appropriate to run on a DSP. such as the fixed or floating point types 
15 made by TI or Motorola or o^hers, or on a general purpose microprocessor, or any type 
of firmware, or programmable or fixed hardware, or any combination. The software can 
in principle be implemented as instructions or as combinations of data, instructions, 
rules, objects and so on. 

20 Other variations and implementations within the scope of the claims will be apparent to 
those skilled in the art, and are not intended to be excluded. 

As has been described above, an echo detector correlates between an incoming signal an 
echo signal to determine echo delay. A preprocesssor extracts a characteristic pattern 

25 such as a binary pattern derived by thresholding an averaged power of each signal. This 
pattern is down sampled then used for the correlation, and delay is deduced from 
consistent peaks in the correlation. Advantages include reduced computational load, or 
larger correlation window size, because the characteristic pattern is easier to correlate. 
The quantising can enhance the correlation peaks, and the averaging can limit the 

30 number of transitions, to give more robustness to noise. An echo canceller has a coarse 
echo delay estimator and a fine echo delay estimator. The coarse estimated delay is used 
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to adjust a delay detection range of a fine delay detector having a narrow range. The 
output of the fine detector is used to adjust or suppress the adaptive echo canceller. 
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